Word Sense Disambiguation Using Target Language Corpus in a Machine Translation System

نویسندگان

  • Tayebeh Mosavi Miangah
  • Ali Delavar Khalafi
چکیده

This article studies different aspects of a new approach to word sense disambiguation using statistical information gained from a monolingual corpus of the target language. Here, the source language is English and the target is Persian, and the disambiguation method can be directly applied in the system of English-to-Persian machine translation for solving lexical ambiguity problems in this system. Unlike other disambiguation approaches, using corpora for handling the problem, which use the Most Likelihood Model in their statistical works, this article proposes the Random Numbers Model. We believe that this model is more reasonable from the scientific point of view and find that it offers the most precise and accurate results. This method has been tested for a selected set of English texts containing multiple-meaning words with respect to Persian language and the results are encouraging. ..................................................................................................................................

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Kannada Word Sense Disambiguation for Machine Translation

Polysemous Words can have more than one distinct meaning. Word sense disambiguation (WSD) is the ability to identify the exact meaning of such polysemous words in context in a computational manner. WSD is considered as an AI-complete problem, that is, a task whose solution is at least as hard as the most difficult problem in Artificial Intelligence. In this paper, we propose an Integrated Kanna...

متن کامل

Word Sense Disambiguation in Bengali applied to Bengali-Hindi Machine Translation

We have developed a word sense disambiguation(WSD) system for Bengali language and applied the system to get correct lexical choice in Bengali-Hindi machine translation. We are not aware of any existing system for Bengali WSD. Since there is no sense annotated Bengali corpus or sufficient amount of parallel corpus for Bengali-Hindi language pair, we had to use an unsupervised approach. We use a...

متن کامل

Towards A Hybrid Approach To Word-Sense Disambiguation In Machine Translation

The task of word sense disambiguation aims to select the correct sense of a polysemous word in a given context. When applied to machine translation, the correct translation in the target language must be selected for a polysemous lexical item in the source language. In this paper, we present work in progress on a supervised WSD system with a hybrid approach: on the one hand it relies on supervi...

متن کامل

Using a Target Language Model for Domain Independent Lexical Disambiguation

In this paper we describe a lexical disambiguation algorithm based on a statistical language model we call maximum likelihood disambiguation. The maximum likelihood method depends solely on the target language. The model was trained on a corpus of American English newspaper texts. Its performance was tested using output from a transfer based translation system between Turkish and English. The m...

متن کامل

Word Sense Disambiguation Based Myanmar-to-english Machine Translation System

Today, word sense disambiguation (WSD) is an important technique for many natural language processing (NLP) applications such as grammatical analysis, content analysis, information retrieval and machine translation. Among them, the WSD technique is used for machine translation to find the correct sense of a word in a specific context. In machine translation, the input sentences in the source la...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • LLC

دوره 20  شماره 

صفحات  -

تاریخ انتشار 2005